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well suited for digital data processing and/or storage systems encountering non-bursty, (i.e., substantially random), error patterns, such is 
characteristic of data storage and retrieval systems employing semiconductor based memory stores. 5-bit extension field operations, (i.e„ 
over a Galois field GF(2)), generated by using the irreducible polynomial P(x) - x + x + 1, over GF(2) ( are utilized to perform certain, 
requisite arithmetic functions over the Galois field GF(2) with a hardware-minimized error correction architecture. 
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DEDICATED ALU ARCHITECTURE FOR 10-BIT REED- SOLOMON 
ERROR CORRECTION MODULE 

Field of the Invention 
This invention pertains to the field of error correction in 
digital data processing and storage systems and, more particularly, 
to methods and apparatus for employing a 10-bit Reed-Solomon code 
for the detection and correction of digital data errors within a 
block of data being transmitted between elements of a data 
processing and/or storage system. 

Background of the Invention 
Error correction techniques and architectures are well known 
in digital data processing and communications systems, including 
systems having data storage subsystems such as magnetic, optical, 
or semiconductor based memory stores. Detection and, where 
possible, correction of erroneous data has been achieved by using 
an encoder circuit to construct some number of "redundant" m-bit 
error check symbols, which mathematically characterize the 
' information in a selected block of data. The error check symbols 
are then appended to the data block and transferred or stored 
therewith. When the data block is received, or later retrieved 
from memory, the accuracy of the data can be evaluated by use of 
these appended error check symbols. For example, one methodology 
is to repeat the encoding process on the received or retrieved 
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block of data, often using the same encoder, and then compare the 
newly derived error check symbols with those previously appended to 
the data block. if the newly derived error check symbols are 
identical to those appended to the data block, the information 
contained in the received data block is presumed to be error free; 
if not, "error syndrome" information is generated from the 
difference between the newly derived and originally appended error 
check symbols, which, depending upon the nature and amount of the 
error check information, can be used to locate the error (s) within 
the data block and determine the correct data values for 
substitution therefor. 

The use of Reed-Solomon codes has become a prevalent 
methodology for performing error detection and correction in 
digital communications, processing and storage applications, in a 
Reed-Solomon code, sequential m-bit data symbols forming a data 
block are treated as being representative of coefficients of a 
polynomial in a variable, e.g., »x". In particular, a sequence of 

k m-bit data "message" symbols <«,, m„ m, > are treated 

as a "message polynomial," m(x) , of degree k-1, where 

m(x) = mox*" 1 + m,x k " 2 + . . . + m^x + m^, 
An encoder divides the message polynomial, m(x), by a selected 
"generator polynomial," g(x), to produce a "remainder polynomial," 
r(x), having coefficients in the form of r m-bit error check 
symbols. 
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A Reed-Solomon generator polynomial, g(x), has the general 

form: 

where a ("alpha") is a primitive element of a Galois field, GF(q), 
whose q-l powers, {a 0 ,** 1 ,^,. . .,a«- 2 }, exhaust the non-zero elements 
of the field and where J 0 is some arbitrary logarithm base a of the 
first root of the generator polynomial. Usually, in binary 
systems, q is a power of 2, i.e., q = 2 m , for some m. For example, 
where r = 4, — i.e., where four redundant error check symbols are 
to be generated — , the generator polynomial may be expressed in 
the factored form: 

g(x) = (x - a'°)(x - a' 0+, )(x - a J0 + 2 )(x - a ,0 + 3 ) . 
As can be seen, the nature of the generator polynomial, g(x) , 
determines, among other things, the extent and complexity of the 
error correction code. In particular, the degree of the generator 
polynomial determines the number (r) of error check symbols 
contained in the remainder polynomial, r(x) . However, the greater 
the degree of the polynomial that is selected for the generator 
polynomial, the more complex the associated encoder circuitry must 
be to perform the necessary computations. 

Generally, the error check symbols are appended to the message 
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symbols to form a Reed-Solomon "code word" of length n - k + r m- 
bit symbols, where the message and error check symbols are elements 
in the finite Galois field GF(2-). This is referred to as an »n- 
symbol codeword, » consisting of "m-bit symbols," or as an example 
of an «m-bit Reed-Solomon implementation." By definition, each 
code word, when considered as a polynomial, c(x), of degree n-1, 
where 

c(x) = c 0 x-' + c.x* 2 + . . . + c^x + c.,, 
is evenly divisible by the generator polynomial, g(x). in other 
words, c(x) mod g(x) = o. Since a correctly received code word, 
C(x), will also be evenly divisible by g(x), one well known 
procedure for detecting errors upon receipt or retrieval of a code 
word, c'(x), is to divide it by g(x) . if c (x) mod g(x) is zero, 
then c'(x) is presumed to have been correctly received. If the 
remainder of c'(x)/g(x) is non-zero, then one or more errors have 
occurred and an error correction routine is invoked by calculating 
r m-bit "error syndromes," s, . . . s,.„ respectively, (i.e., 
"symptoms of error"), from the non-zero remainder of C (x)/g(x) . 

Several procedures are known for computing the error locations 
and proper values from the calculated error syndromes. These 
techniques involve error- locator-polynomial determination, root 
finding for determining the positions of the errors, and error 
value determination for determining the correct bit-pattern of the 
errors, in many error detection and correction architectures, the 
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block of data being evaluated is temporarily stored in a buffer 
memory, while the error correction procedure is performed with the 
error syndromes generated from the received, or retrieved, code 
word. Known architectures may perform the error correction 
routines by using hardware circuitry under the control of a 
programmed state machine or arithmetic logic unit, by using a 
microprocessor under firmware control, or by some combination of 
both. For example, several known architectures perform the 
correction of a very small number of errors, typically no more than 
one or two, by hardware circuitry "on-the-f ly , « i.e., without 
stopping or substantially slowing the bit-rate of the data blocks 
during a typical transfer, or retrieval, of multiple blocks of 
data. The correction of more than these one or two errors, if 
enabled by sufficient error syndrome information, is typically 
given to a system level microprocessor to carry out the error 
correction operations on an as-needed, or "interrupt" basis, i.e., 
where the data flow is temporarily slowed, or stopped. When this 
happens, the microprocessor is fed the error correction syndrome 
information for a data block having the more than one or two errors 
and calculates values for locating and correcting all of the errors 
under direction of error correction firmware. In the event the 
data is not correctable, i.e., where there are more errors detected 
in the block than can be corrected by the hardware and/or firmware 
error correction architecture, an error recovery procedure may be 
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attempted, e.g., which may involve one or more retries to recover 
data from the storage media. 

One general limitation of Reed-Solomon codes is that the 
maximum number of m-bit data symbols that can be corrected within 
an m-bit code word is limited to the integer portion, or "floor," 
of the number of appended error check symbols (r) divided by two, 
i.e., INT(r/2) . Another general limitation is that each ra-bit code 
word is limited to a maximum length of 2° - l symbols, sometimes 
referred to as its "natural block length," including both the 
source data symbols and the appended error check symbols. Thus, 
implementations of Reed-Solomon error correction techniques must 
take into account these limitations. 

For example, commonly owned U.S. Patent No. 5,241,546, issued 
to Peterson et al., discloses a system architecture which employs 
a Reed-Solomon code to detect and correct data errors in a disk 
drive data storage system which handles data in blocks, or 
"sectors," containing 512 8-bit symbols (or "bytes"), d[l], d[2], 
• • ., d[512], respectively. Because the total number of 8-bit 
symbols that can be protected in a single 8-bit code word is 2 8 - 
1 = 255 symbols, each 512 byte data sector is divided into smaller 
blocks for purposes of performing error correction. In so doing, 
the Peterson et al. Patent also takes into account that data errors 
encountered in certain digital data communication and storage 
systems, such as in magnetic based disk drive systems, are often 
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"bursty" in nature, i.e., the data errors tend to appear across 
consecutive bit positions within a block of data, as opposed to 
occurring randomly. The 512 data bytes are therefore serially 
divided into three separate "interleaves," each interleave 
5 containing every third data byte, — i.e., with interleave 1 
containing data bytes d[l), d[4], d[7], . . d[5li], interleave 
2 containing data bytes d[2], d[5], d[8], . . d[5i2], and 

interleave 3 containing data bytes d[3], d[6], d[9], . . . , 
d[510], respectively. Each interleave is then separately encoded 

10 to form a corresponding 8-bit Reed-Solomon code word. 

By distributing successive data bytes into separate code words 
for purposes of error correction, the 3-interleave architecture 
offers the advantage of, for example, treating the occurrence of an 
"error burst" of three successive corrupted data bytes as, in 

15 effect, a single byte error occurring in each code word. In other 
words, because of the substantial likelihood that data errors, if 
any, will occur in consecutive data symbols, the 3-interleave 
architecture increases the probability that encountered symbol 
errors will be distributed evenly into separate code words, thereby 

20 facilitating correction of a greater number of total symbol errors 
per data sector, without requiring additional error correction 
capability for a given code word. 

The advantage of an interleave architecture is lost, however, 
if the communication, data processing, and/or storage system 
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application is not prone to bursty error patterns. For example, in 
a digital data system employing a semiconductor based memory in 
which error patterns occur in a substantially random manner, 
implementation of an interleave architecture may actually result 
in a substantial portion of the allocated error check redundancy 
being ineffective, since encountered errors are much less likely to 
be evenly distributed across the interleaves. Moreover, if the 
number of errors occurring in a single interleave exceed the error 
correction capability of that code word, the entire data sector may 
be lost, even if the total number of errors occurring sector -wide 
would otherwise have been within the allotted error correction 
capability had they been distributed evenly across all interleaves. 

Thus, in a non-bursty applications it would be desirable to 
fully utilize all allocated error check redundancy to perform error 
correction for an entire data block or sector, e.g., by using of a 
single Reed-Solomon code word, regardless of the particular symbol 
size format of the data block. 
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Summary of the Invention 
The present invention provides a Reed-Solomon error correction 
methodology and architecture which is suited for digital data 
communication, processing and/or storage systems encountering 
substantially random error patterns, such is characteristic of a 
data storage and retrieval system employing a semiconductor based 
memory store, 

A general object of one aspect of the present invention is to 
provide a Reed-solomon error correction methodology and 
architecture ideally adapted to a mass production environment, 
which minimizes hardware components and the related production 
costs associated therewith. 

A general object of another aspect of the present invention is 
to provide system architecture for implementing a 10-bit Reed- 
Solomon code for detecting and correcting data errors in a single 
code word to protect a data block containing up to 1023 10-bit data 
symbols, i.e., the equivalent of up to 1278 8-bit symbols, 
including error check redundancy. 

A general object of yet another aspect of the present 
invention is to provide a Reed-Solomon error correction 
architecture which maximizes the use of all allocated error 
correction overhead for an entire block of data, regardless of the 
particular error pattern characteristics encountered in a given 
system application. 
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A general object of yet another aspect of the present 
invention is to provide a practical implementation of a 10-bit 
Reed-Solomon error correction code by employing 5-bit extension 
field operations over a Galois field (2 5 ) to perform certain, 
requisite arithmetic functions with a hardware-minimized error 
correction architecture. 

A more specific object of the hardware-minimized 
implementation aspect of the present invention is to provide a 10- 
bit Reed-Solomon error correction methodology and architecture 
which employs 5-bit extension field operations utilizing a Galois 
field GF(2 5 ) generated from the generator polynomial g(x) = x 5 + x 2 
+ 1, over GF(2) . 

Yet another more specific object of the hardware-minimized 
implementation aspect of the present invention is to provide a 10- 
bit Reed-Solomon error correction methodology and architecture 
which generates the Galois field GF(2'°) using the irreducible 
polynomial P I024 (y) = y 2 + y + l, over a Galois field GF(2 5 ) generated 
from the generator polynomial g(x) = x 3 + x 2 + 1, over GF(2). 

In accordance with these and other objects, features and 
aspects of the present invention, a data block of m-bit data 
symbols is formatted into a sequence of k 10-bit data symbols and 
passed through an encoder which constructs a preselected number r 
10-bit error check symbols, where k + r < 1023. The error check 
symbols are converted back into an m-bit symbol format and appended 
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to the original m-bit data block for transport or storage. Upon 
being received, or later retrieved from storage, the m-bit data 
block, including the appended error check symbols, is again 
formatted as a sequence of 10-bit symbols and passed through a 
decoder/ syndrome generator, which generates r 10-bit error 
syndromes. if any of the error syndromes are non-zero, thereby 
indicating the existence of one or more 10-bit symbol errors, the 
error syndromes are fed into a error correction module, which, 
using a Galois field arithmetic logic unit ("GF-ALU" ) under the 
control of a programmed state machine, conducts 10-bit Reed-Solomon 
error correction operations (i.e., over a 10-bit Galois field 
GF(2 10 )), preferably to determination the location and correct 
values of up to two 10-bit symbol errors by hardware solution and 
up to four 10-bit symbol errors under firmware control. 

According to one aspect of the present invention, the GF-ALU 
performs certain arithmetic functions, preferably including at 
least the 10-bit multiplication and inversion functions, 
respectively, by 5-bit extension field operations over a Galois 
"sub field" GF(2 5 ) , generated by using the irreducible polynomial: 

P 3 2(x) = x 5 + x 2 + 1, over GF(2), 
with each 5-bit nibble represented in a standard basis as a fourth 
degree polynomial of "x"; i.e., where a 5-bit nibble {d 4/ d 3 ,d 2 ,d |/ d 0 } 
is represented as coefficients in the polynomial: 

d 4 x 4 + d 3 x 3 + d 2 x 2 + d,x +d 0 . 
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Addition of the 5-bit nibbles is preferably performed through 
a bit-wise XOR operation. Multiplication is preferably performed 
as Standard polynomial multiplication, reduced modulo P„(x) . 

According to another aspect of the present invention, the 
field GF(2'») is generated using the irreducible polynomial: 

PioM(y) = y 2 + y + 1, over GF(2 S ) , 
with the 10 bit symbols represented in a standard basis as a sum of 
the powers of »y» over GF(2>), i.e., with each 10-bit symbol treated 
as two 5-bit nibbles (e„ e 0 ) , which represents e,y + e 0 , to perform 
the requisite arithmetic functions. Addition of the 10-bit symbols 
is preferably performed through a bit-wise XOR operation. 
Multiplication is preferably performed as standard polynomial 
multiplication, reduced by modulo P 1024 (y). ln one preferred 
embodiment, the element a = (ioi)h (i.e., {0100000001} binary), 
serves as a primitive element of GF(2'°). 

In this manner, 10-bit arithmetic operations are accomplished 
by concatenating the result of 5-bit extension field operations 
performed on the most significant and least significant 5-bit 
nibbles, respectively, of the respective 10-bit symbols. 

These and other objects, aspects, advantages and features of 
the present invention will be more fully understood and appreciated 
by those skilled in the art upon consideration of the following 
detailed description of a preferred embodiment, presented in 
conjunction with the accompanying drawings. 



- 12 



WO 97/00559 



PCT/US96/10074 



Brief Description of the Drawings 

It is to be understood that the accompanying drawings are 
provided for the purpose of illustration only, and are not intended 
as a definition of the limits of the invention. The drawings 
illustrate both the design and utility of a preferred embodiment of 
the present invention, in which; 

FIG. 1 is a functional block diagram of the host-to-memory 
(write) circuit of a semiconductor based data storage and retrieval 
system embodying aspects of the present invention; 

FIG. 2 is a functional block diagram of the memory-to-host 
(read) circuit of the semiconductor based data storage and 
retrieval system shown in FIG. 1; 

FIG. 3 is a flow chart depicting a preferred error correction 
process ; 

FIG. 4 is a block diagram illustrating the functional system 
architecture of a preferred error correction module; and 

FIG. 5 is a block diagram illustrating the functional system 
architecture of a preferred Galois field arithmetic logic unit 
within the error correction module of FIG. 4. 

Detailed Descri ption of the Preferred Embodiment 
Referring to FIG. 1, a host data processing system 10 
packetizes a block of digital data for storage in an adjunct data 
storage and retrieval system 12. While particular blocks of data 
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being transported by the host system io to the storage system 12 
nay vary in size and format with differing applications and 
operating system architectures, for purposes of describing the 
illustrated preferred embodiment, a commonly used data block size 
is selected, which contains 256 16-bit data symbols for a total of 
4096 bits. A host interface circuit 14, such as, e.g., a SCSI or 
IDE interface device, receives the data block from the host system 
10. The interface circuit 14 then transmits the data block through 
a 16:10 bit wide conversion circuit 16, which serially reformats 
the 256 16-bit data symbols into 410 10-bit data symbols, i.e., a 
total of 4100 bits, with the final four bits of the 410th data 
symbol comprising added zeros. The data block is also transmitted 
by the host interface circuit 14 directly into to a holding buffer 
18, without reformatting. 

In alternate embodiments, the data block 12 may be packetized 
in varying bit formats by the host system 10 and/or host interface 
circuit 14. For example, the same size data block (i.e., 4096 
bits) may be packetized in a "byte-size" format as 512 8-bit data 
symbols, or "bytes," as well. m this later case, the bit-wide 
conversion circuit 16 would be designed to reformat the 512 8-bit 
data symbols into the 410 lo-bit data symbols, i.e.. at an 8:10 
ratio instead of a 16:10 ratio. Alternatively, the data block may 
be originally packetized by the host system 10 in a 10-bit symbol 
format, which would eliminate the need for the bit-wide conversion 
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circuit 16 within the data storage and handling system 12. 

Whatever the original symbol bit-format of the data block, 
once converted into 10-bit format, the individual data symbols are 
transmitted through a 10-bit Reed-Solomon encoder circuit 20, which 
generates a preferred number r of 10-bit error check symbols based 
on a preferred generator polynomial, g(x), where r is equal to the 
polynomial degree of g(x) . For purposes of describing the 
illustrated preferred embodiment, wherein correction of up to two 
10-bit data errors by direct hardware solution and up to four 10- 
bit errors by firmware solution is preferred, an exemplary 
generator polynomial g(x) of degree r = 8 has been selected, where 
(in factored form) : 

g(x) = (x + a 0 ) (x + a'Hx + a 7 ) (x + a 3 ) (x + a 4 )* 

*(x + a 5 ) (x + a 6 ) (x + a 7 ) . 
While any number of known Reed-Solomon encoder circuits may be 
utilized to generate the error check symbols, e.g., such as an 
eight stage linear shift register with feedback, a combined 
encoder /syndrome generator circuit 20 of the type disclosed in an 
article entitled "A Combined Reed-Solomon Encoder and Syndrome 
Generator with Small Hardware Complexity," by G. Fettweis and M. 
Hassner, p 1871-74, IEEE, 0-7B03-0593-0/92 (1992), is preferably 
employed. An embedded system microprocessor 22 directs the 
combined encoder/syndrome generator circuit 20 to switch to "encode 
mode," wherein the encoder circuit 20 divides the 410 10-bit data 
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symbols of the data block by g(x) to produce a "remainder" 
comprising 8 10-bit error check symbols. 

The 8 error check symbols are transmitted through a 10:16 bit 
wide conversion circuit 24, which serially reformats the 8 10-bit 
error check symbols into 5 16-bit symbols. The 5 16-bit symbols 
are then appended to the 256 16-bit data symbols of the original 
data block in buffer 18, i.e., which were transmitted directly from 
the host interface circuit 14, the 261 16-bit symbols forming a 
code word. The code word is released from buffer 18 by the system 
microprocessor 22 and is stored in a semiconductor based memory 
store 26, such as, e.g., a "FLASH" ram. In alternate 
configurations and embodiments, the data block may be transmitted 
directly from the host interface circuit 14 into the memory store 
26, i.e., without employing a holding buffer, with the reformatted 
(16-bit) error check symbols appended thereto directly in the 
memory store 26. 

Referring to FIG. 2, the 261 16-bit code word is retrieved 
from the semiconductor memory store 26 and transmitted into a 
holding buffer 28, which may be the same buffer as buffer 18. The 
code word is also transmitted through a 16:10 bit-wide conversion 
circuit 30. Preferably, holding buffer 28 retains .only the first 
256 symbols of the code word, i.e., only the symbols comprising the 
original source data and not the 5 error check symbols, while all 
261 symbols are passed through the 16:10 bit-wide conversion 
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circuit 30, The bit-wide conversion circuit 3 0 reformats the 261 
* 16-bit code word into a 418 * 10-bit codeword, i.e., the 410 
10-bit source data symbols, including the added four zeros in the 
410th symbol, plus the 8 10-bit error check symbols. 

The 418 10-bit symbol code word is passed through the combined 
encoder/syndrome generator 20, which, having been switched to 
"syndrome" mode by the system microprocessor 22, generates eight 
10-bit error syndromes, S 0 . . . S 7 , respectively, therefrom. Other 
syndrome generation circuitry may be alternatively be employed, 
such as conventional DFT circuits, or the like. The error 
syndromes are evaluated in the encoder/ syndrome generator 2 0 and, 
if all syndromes are determined to be zero, (i.e., 80 consecutive 
zero bits) , the retrieved data block is presumed to be error free 
and the system microprocessor 22 signals the holding buffer 28 to 
release the 256 16-bit data symbols to the host interface circuit 
14. The host interface circuit 14 then transmits the data block to 
the host system 10 for use in its intended application. 

If, however, one or more of the 8 10-bit error syndromes is 
non-zero, the syndromes are supplied to an error correction module 
36 for determination of the location and correct values of the 
symbol (s) in error. As depicted in the flow chart in FIG. 3, the 
error correction module 36 executes the "direct solution," or 
"Peterson Gorenstein Zierler" algorithm, for finding the 
location (s) and value (s) of up to two 10-bit symbol errors using a 
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hardware solution. If the existence of more than two errors is 
detected during the error correction routine, the error syndromes 
are transferred to the system microprocessor 22, which, under 
firmware control, can determine the correct locations and values of 
up to four 10-bit symbol errors. 

Referring to FIG. 4, the error correction module 36 operates 
under the control of a programmed state controller 42, which 
directs a specialized 10-bit Galois field arithmetic logic unit 44 
("GF-ALU") , to perform the requisite addition, multiplication, 
inversion, squaring and table look-up operations necessary to 
execute the error correction algorithm depicted in FIG. 3. Eight 
registers, R,, - r 7 , 46a-h, respectively, are provided for holding 
interim results, with register load enables 48 (not shown in 
detail) provided to control which register is loaded with the 
result of the operation from the particular preceding instruction 
of the state controller 42. In the illustrated preferred 
embodiment, register R„, 46a, is a dedicated source operand for 
addition operations and register R, , 46b, is a dedicated source 
operand for multiplication operations, respectively. Registers R 4 , 
46e, and R,, 46f , respectively, are used for storing the calculated 
error location (or "locator") values, X 0 and (if double error) X,, 
respectively. Registers R,, 46g, and R 7 , 46h, respectively, are 
used for storing the calculated error values, Y 0 and (if double 
error) Y, , respectively. It may be possible to use a different 
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register configuration in alternate embodiments, depending on the 
order of steps performed. 

* The following tables list the preferred state instructions 
given by the state controller 42 to the GF-ALU 44. Table 1 lists 
the state instructions for the "Double Error Decoding" process, 
Table 2 includes the state instructions for "Double Error Checking" 
process, and Table 3 includes the state instructions for "Single 
Error Decoding and Checking" process, respectively. In the state 

tables, "st " identifies each given state; "src" identifies the 

"source" register or error syndrome, respectively, which contains 
or comprises the input data to be used in the specified operation. 
"Inst" refers to the particular instruction, or mathematical 
operation to be performed by the GF-ALU 44, where "load" calls for 
loading the input value into a specified register, "mult" calls for 
multiplying the input value with the contents of register R l# "sqre" 
calls for squaring the input value, "accm" calls for adding the 
input value to the contents of register Rq, and "noop" calls for a 
branch to another state if a specified zero or non-zero value is 
determined. The "dec" entry refers to the specific register 
location, if any, in which the resulting value of a given state 
operation is to be stored. The new contents, if any, of the 
registers following each state operation are also included, as is 
a brief comment to explain, if necessary, the operation that was 
performed. 
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In the preferred state instructions set forth in Tables 1-3, 
respectively, there are a total of 106 possible instructions (or 
"states") to be carried out during execution of the error 
correction algorithm The state controller 42 will continue serial 
execution of instructions, until either a double or single error 
solution is completed, or until both fail, with the actual number 
of states depending upon which event occurs. If either the double 
or single error decoding process is successfully completed, both 
the location(s) of the lo-bit symbol error(s), x 0 and (if double 
error) X„ respectively, and the correct 10-bit data value(s) for 
substitution therefor, V 0 and (if double error) Y„ respectively, 
are obtained from the respective registers - R „ 46e - n , by the 
system microprocessor 22. in the event both double and single 
solutions fail, the existence of more than two errors is presumed 
and the eight error syndromes, S 0 - s„ respectively, are 
transferred to the system microprocessor 22, which preferably can 
calculate the locations and correct values of up to four 10-bit 
symbol errors under firmware control. 

Whether calculated by the gf-ALU 44, or by the system 
microprocessor 22, the corrected data location(s) and value(s), 
respectively, are substituted for the erroneous data value(s) by 
the system microprocessor 22, while the retrieved data block is 
still retained in the holding buffer 28. Because the location^, 
and correct value (s) , respectively, are calculated based on the 10- 
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0 



bit a^ooi IO x»t. th. S y 5 t„ .ioroprooasaor 22 transiat.s t». io- 
bit location,,, ,„„ value(s) ln£0 the corc . spoMi „ g ls . Mt 
nation,., „ d value(s) , befor> substitution ^ MTCk 
Th. oat, bl oc k is than reUasea tro « the holdlng buffer ^ ^ ^ 
boat i„te«ao. oircuit X a„o host syataa l0 . „spa=tiva ly . for use 
in its intended application. 

Re ferring to FIG. 5, the state controller 4 2 serially invokes 
each new GF-AU, instruction 45 to the CP-^u 44 . A 10 . bit ^ 
contained either in one of registers , „ . . . R? , 4fia . h> 
respectively, or comprising a particular error syndrom, s 0 . 
S„ respectively, is used as the input v&lue ^ ^ 
indicated in the GF-ALU instruction The ^ 

ar.th.etic operations, i.e. over a Galois field GF( 2 .»> or 
-«(1024, f . include addition 52 , which is preferably ca „ ied ^ fay 
X'OR operation, multiplication 54, inversion 56, and squaring 58 
respectively, as indicated in the particular state instruction. A 
y J + y + C solution table look-up function 60 is also provided in 
the gf-alu 44 , for finding the guadrat . c solut . on(s) ^ ^ Gaio . s 

process . 



After each instruction, the GF-ALU output 49 is loaded into 
one of registers, R, . . . R „ 46a . h< respectively/ which ^ 

by a destination address 4 7 sent by the register load enables 48 
The GP-ALU output 49 is occasionally checked for a zero or non-zero 
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value by a zero detect circuit 62, in order to verify whether a 
particular single or double error solution attempt has succeeded or 
failed, respectively. if either a zero or non-zero value is 
detected, depending upon the particular state, a status bit or 
"flag" 66 is sent to the state controller 42, which will set the 
appropriate status, e.g. "decoder fails," or "successful 
double/single error decoding," to alert the system microprocessor 
22 to obtain the requisite information from either the syndrome 
generator (if "decoder fails") or the appropriate registers R, - r 7> 
46e-h, respectively, m certain state operations, the failure to 
locate a viable solution in the look-up table 60 will also trigger 
a status flag 66 to be sent. 

In accordance with one aspect of the present invention, the 
operations of the GF-ALU 44 are preferably substantially hardware- 
minimized by employing 5-bit extension field operations over a 
Galois field GF<2>), or GF(32>, to assist in performing certain of 
the 10-bit arithmetic operations. in the illustrated preferred 
embodiment, the 5-bit extension field operation units include 5-bit 
multipliers 55, and inverters 57, respectively. 

in the 5-bit operations, each 5-bit nibble or "number" is 
treated in a standard basis as a fourth degree polynomial, i.e., 
where each 5-bit number, {d^^d.,^} , is represented as 
coefficients in the fourth degree polynomial, d 4 x< + d,x> + dj x> + d,x 
+d 0 x°. The Galois field GF(2>), or GF(32) , which is defined by an 
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arbitrarily selected first element and a fifth degree generator 
polynomial, designated herein as "p(x),« is preferably generated 
based on the irreducible polynomial x 3 + x' + l. its first element 
a' (i.e., excluding o°) , is selected to be equal to x, where x 
5 represents the binary field element in polynomial representation 
over GF(2) , as is known in the art. The 31 possible non-zero field 
elements are represented by the successive powers of alpha, 
including a°. Accordingly, each successive element in the field may 
be determined by multiplying the preceding field element by a', mod 
10 p( X ) . Put another way, the non-zero field elements are represented 
by the antilogs of the powers of a from 0 to 31, where each of the 
antilog values are calculated mod p(x), so that no antilog value 
can exceed 31, with a 31 mapping back to o°, (i. e. , a 3 ' = a 0 = i) . For 
example: 

= x 1 mod p(x) = 2' = (binary) 00010; 
= a'-o 1 mod p(x) « x' x 1 mod p(x) « x 7 = 00100; 
■ a J -a' mod (p(x) = 01000; 
= a 3 -a 1 mod (p(x) = 10000; 

= Q 4 a« = x 4 x' mod p(x) = x 5 mod (x 5 + x 2 +1) = 

x J + 1 = 00101 (etc. ) . 

After the GF(32) antilog table is calculated, generating a log 
table ("base a") is simply a matter of reversing the antilog table 
by mapping each power of alpha with its corresponding antilog 
value. Hardware multipliers, inverters and adders for GF(32) with 
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a 1 = x and p(x) = x 5 + x 3 + 1 over GF(2), are maintained within the 
GF-ALU 44 to facilitate the 5-bit extension field operations. To 
facilitate further explanation of the 5-bit extension field 
operations, the antilog and log tables (decimal) for GF(2 5 ) are set 
5 forth in Tables 4 and 5: 

TABLE 4. Antilog Table/Elements of GF(2 5 ). 
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TABLE 5. Log Table for GF(2 $ ) 
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Preferably, the system microprocessor 22 is also provided with 
the appropriate Galois field logs and antilogs to facilitate 
calculations for the three and four error correction routines. 

Once the results for GF(2 5 ) are determined, the 10-bit 
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multiplication and inversion operations may be easily performed by 
5-bit "extension field" operations. In particular, according to 
another aspect of the present invention, the 10 bit numbers are 
represented in a standard basis as a sum of powers of "y" over 
5 GF(2 5 ), with each 10-bit symbol treated as two 5-bit nibbles or 
"numbers," (e 0/ e,) , which represent e,y + e 0 , to perform the 
requisite multiplication and inversion functions. Preferably, the 
multiplication of the 10-bit numbers is performed as standard 
polynomial multiplication, reduced by mod P l02 4<y) , where P(y) is the 
10 irreducible polynomial: P 10W (y) = y 2 + y + l, over GF(2 5 ). 

By way of example, let 1^ = 0101110100 and m, = 1001101111, A 
10-bit multiply of * m, is required. According to this aspect of 
the invention, it is carried out as follows: 
mo = {01011}y + {10100}; 
15 m, = {10011}y + {01111}. 

Letting {01011} = a (= (ll)dec) ; 

{10100} = b (= (20)dec) ; 
{10011} = c (= (19)dec); and 
{01111} = d (= (15)dec) ; 
20 mo * m t = (ay + b) (cy + d) = acy 2 + (be + ad)y + bd. 

(acy 2 + (be + ad)y + bd) mod (y 2 + y + 1) = 

(ac + be + ad)y + (bd + ac) . 
Each of the two 5-bit products, (i.e., ac, be, ad, bd) , may be 
determined by referring to the previously generated antilog and log 
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(base a) tables, as follows: 

ac = antilog a [ (log a a + log ft c) mod (31)] 

« antilog[(27 + 17) mod (31)] 
= antilog[l3] = 28. 
5 be = antilog a [ (log tt b + log a c) mod (31)] 

= antilog[(7 + 17) mod (31)] 
= antilog[24] = 30. 
bd = antilog 0 [ (log a b + log a d) mod (31)] 

« antilog[(7 + 23) mod (31)] 
10 = antilog[30] » 18. 

ad = antilog a [ (log tt a + log a d) mod (31)] 

= antilog[(27 + 23) mod (31)] 
=* antilog[19] = 6. 
Addition of the five bit products is performed by standard XOR 
15 operation: 

(ac + be + ad) = {11100} + {11110} + {00110} = {00100}; and 
(bd + ac) « {10010} + {11100} = {OHIO}. 
Concatenating the two results, Og * o, » {0010001110} . 

To ensure full disclosure of the aforedescribed preferred 
20 embodiment, a corresponding verilog listing of the error correction 
module 36 is provided as follows: 
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module decoder! 



constclk, 


/* positive edge byte clock 


*/ 




decstart, 


/* start of decoding 


*/ 




resetn, 


/★ decoder negative reset 


*/ 




syrsOOOO, 


/+ syndrome 0 




*/ 


syrsOOOl, 


/* syndrome 1 




*/ 


syrs0002, 


/* syndrome 2 




*/ 


syrs0003. 


/* syndrome 3 




*/ 


syrs0004, 


/* syndrome 4 




*/ 


syrsOOOS, 


/* syndrome 5 




*/ 


syrs0006, 


/* syndrome 6 




*/ 


syrs0007. 


/+ syndrome 7 




*/ 


locaterO, 


h locater of first error 




*/ 


locaterl, 


/* locater of 2nd error 




*/ 


errvaluO, 


/* value of first error 


+/ 




errvalul, 


/+ value of 2nd error 


*/ 




decrstat. 


/+ decoder status flags 




+/ 


finished 


/* status indicating finished 


*/ 





); 



input constclk; 
input decstart; 
input resetn; 

input [9:0] syrsOOOO; 
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input [9:0] syrsOOOl; 

input (9:0) syrs0002; 

input ' (9:0) syrs0003; 

input 19:0] syrs0004; 

input (9:0] syrsOOOS; 

input [9:0] syrs0006; 

input [9:0] syrs0007; 

output [9:01 locaterO; 

output [9:0] locaterl; 

output [9:0] errvaluO; 

ouput [9:0] errvalul; 

ouput (9:0] decrstat; 

output 11:01 finished; 



// synopsys state.vector st8 

parameter [6:0] /+ synopsys enum bus.states */ 

st.00 = 7'h00, st^OI - 7'h01, st_02 = 7'h02, st_03 - 7'h03, 
stj)4 = 7'h04, s^05 m 7'h05, st.06 = 7'h06, st_07 = 7'h07, 
st_08 = 7'h08, st.09 « 7'h09, st.Oa = 7'h0a, st.Ob = 7'hOb, 
st.Oc « 7'hOc, st.Od = 7'hOd, st.Oe = 7'hOe, st.Of « 7'hOf, 
st.10 - 7'h10, st.11 - 7'h11, st_12 *= 7'h12, st.13 « 7'h13, 
stJ4 = 7'h14, st_15 = 7'h15, st_16 « 7'h16, stJ7 = 7*h17, 
st.18 = 7'h18, stJ9 = 7'h19, sUa = 7'h1a, stjb « 7'hlb, 
stjc = 7'hlc, stjd = 7'hld, stje = 7'h1e, st_1f = 7'h1f, 
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5 



10 



20 



st_20 




7'h20, 


st_21 


t= 


7'h21, st.22 




• 7'h22, st_23 


= 7'h23, 


st„24 




7'h24, 


st_25 




« 7'h25, 


st.26 « 7'h26, : 


st.27 = 7 


st.28 




7'h28, 


st_29 




7'h29, st.2a 




7'h2a, s^2b 


= 7'h2b, 


st_2c 


- 


7'h2c, 


st_2d 


- 


7'h2d, st.2e 


rs 


7'h2e, st.2f ■ 


= 7'h2f, 


st_30 




7'h30. 


st„31 




7'h31, st32 


- 


7'h32, st.33, 


= 7'h33, 


st.34 


= 


7'h34, 


st_35 


= 


7'h35, st_36 


= 


7'h36, s^37. 


= 7'h37, 


st.38 


- 


7'h38, 


stJ39 




7'h39, st.3a 




7'h3a, st3b, 


= 7'h3b, 


stJ3c 


= 


7'h3c, 


stJ3d 


— 


7'h3d, st_3e 


= 


7'h3e, st_3f = 7'h3f, 


st_40 




7'h40, 


st.41 




7'h41, st.42 




7'h42, st43 


= 7'h43, 


st.44 


— 


7'h44, 


st.45 


= 


7'h45, st.46 


= 


7'h46, st.47 


= 7'h47, 


st.48 


= 


7'h48, 


st.49 




7'h49, st.4a 




7'h4z, st_4b 


= 7'h4b, 


st_4c 


ss 


7'h4c, 


st_4d 


- 


7'h4d, st4e 




7'h4e, st.4f « 7'h4f, 


st.50 


ss 


7'h50 f 


= 


7'h51, s<_52 




7'h52. st_53 


= 7'h53, 


st.54 


= 


7'h54, 


st_55 


= 


7'h55, st_56 




7'h56, st.57 


- 7'h57, 


st.58 




7'h58, 


st.59 


= 


7'h59, st.Sa 




7'h5a, st.5b 


= 7'h5b, 


st.5c 


- 


7'h5c, 


st_5d 


= 


7'h5d, st_5e 


= 


7'h5e, st.5f = 


7'h5f, 


st.60 




7'h60, 


st_61 




7'h61, st.62 




7'h62, st63 


= 7'h63 f 


st.64 


— 


7'h64, 


st.65 


_ 


7'h65, st66 




7'h66, 67 


= 7'h67, 


st_68 




7'h68, 


st.69 




7'h69, st.6a 




7'h6a, st„6b 


* 7'h6b, 


st_6c 




7'h6c, 


st_6d 




7'h6d, st.6e 




7'h6e> st.6f = 


7'h6f, 


st.70 




7'h70, 


st_71 


1= 


7'h71, st.72 




7'h72, st.73 


- 7'h73, 


st.74 


= 


7'h74, 


st.75 




7'h75, st.76 




7'h76, st.77 


= 7'h77, 


st_78 




7'h78, 


st.79 


= 


7'h79. s^7a 


= 


7'h7a, st„7b . 


= 7'h7b, 


st_7c 


= 


7'h7c. 


st.7d 




7'h7d, st_7e i 


a 


7'h7e, st.7f = 


7'h7f; 
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reg [6:0] /*synopsys enum bus.states +/ st8, nxst8; 





parameter [3:01 










i_r0 « 


4'dO, 


Lrl = 4'dl, \j2 = 4'd2, i_r3 » 


4'd3, 




\j4 «= 


4'd4, 


Lr5 = 4'd5, U6 = 4'd6, i_r7 = 


4'd7, 


5 


i.sO = 


4'h8, 


Lsl = 4'h9, i_s2 = 4'ha, i_s3 « 


4'hb, 




Ls4 - 


4'hc, 


Ls5 = 4'hd, i.s6 = 4'h3, i.s7 = 


4'hf; 




parameter [2:0] 










load = 


3'dO, 


accm = 3'dl, mult - 3'd2, Invt 


= 3'd3, 




dbts = 


3'd4, 


sqre = 3'd5, stat = 3'd6, noop 


= 3'd7, 


10 


o.rO « 


3'dO, 


o.rl = 3'd1, o.r2 = 3 r d2, o_r3 


= 3'd3, 




o_r4 = 


3'd4, 


o.r5 = 3'd5, o.r6 = 3'd6, oj7 


= 3'd7; 



reg [10:0] fnk; 
wire (1:01 decrstat; 

wire [3:01 sourcadd; 
15 wire [2:0] instruct; 

wire [2:0] destnadd; 

wire finished; 

wire 02; 

wire nz; 
20 wire instatus; 
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assign bz = instatus; 
assign nz = -bz; 

assign sourcadd = fnk 110:7); 
assign instruct - fnk f6;4); 
assign destnadd = fnk f 3: 11; 
assign finished = fnk [9); 



always @{st8 or bz or nz> 
case <st8) // synopsys full.case parailel_case 



st.OOrbegin nxst8=st_01; fnk={LsO.»oad,o_r1 ( 1'dO}; end // SO into R1 multiplier 

st.01 :begin nxstB . st_02; fnk = {t_s2,mult.o.„r0, 1 *d0}; end // S0S2 into RO accumulator 

st_02:begrn nxst8 = st_03; fnk-{i.s1 0 sqre,o_r2,VdO}; end // SU2 into R2 

st_03:begin nxst8=st_04; fnk = {4'h2,accm,o_r2,rdO}; end // S0S2 + S1**2 into R2 



/* R2 now contains detlM2] +/ 

/* det(M2[ is nonzero if there are two or more errors */ 

/* detlM2) is zero if there are one or more thatn 2 errors ★/ 



st_04:begin nxst8 = ({7{bzJ}&sMe)| {{7{nz}}&st 05); fnk«|4'h3,ioad,o_r3,VdO}; end // noop branch 
st 05:begin nxst8 = st_06; fnk = {i_s3,mult,o_r0.1'd0}; end // S0S3 into RO 

accumulator 

st 06:begin nxst8 = st_07; fnk = {Ls1,load,o_r1,1'dO}; end // S1 into R1 
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multiplier 

st 07:begin nxst8 = st_08; fnk = {i,s2,mult,o_r3,1'd0}; end // S1S2 

st 08:begin nxst8-st_09; mk = {4'h3.accm,o_r3,rd0); end // S1S2 + S0S3 

/* R3 is the numerator of Lambda 1 */ 
/+ R3 is zero causes the two error solution to fail */ 



st_09:begin nxst8 = |{7{bz}}&st„66)|({7{n Z }}&st.0a); fhk- (4'h3,load.o_r3, VdO}; end // noop branch 
st_Oa:begin nxst8 = st_0b; fnk«{i..s3,mult,o_r0,rd0}; end//SlS3 into RO 

accumlator 

st_0b:beginnxst8«st_0c; fnk = {Ls2.sqre,o_r4.VdO); end // S2*+2 into R4 

st_Oc:begin nxst8«st_0d; fnk = {4'h4.accm,o.r4 f VdO); end // S2**2 + S1S3 into R4 



/* R4 is the numerator of Lambda2 */ 
/+ R4 is zero causes the two error solution to fail */ 



st_Od:begin nxst8 = ({7{bz}}&st,66)K{7{nz))&st.0e); fnk = {4'n4,load,o_r4,1'd0}; end // nop branch 
st_Oe:begin nxst8 = st.0f; fnk = {4'h2,invt,o_r1,rd0}; end // inverse{detlM2I} i nt0 R1 

15 multiplie 

st_Of:begin nxst8 = st_10; fnk«{4'h3,mult,o_r3,VdO}; end // Lambdal into R3 

stJOrbegin nxst8 = st_1 1 ; fnk = {4'h4,mult,o_r4,1'd0}; end // Lambda2 into R4 



/* now we have equation of the form z**2 + Lambda2 » = 0 to solve */ 
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stJ1:begin nxst8 = stJ2; fnk- {4'h3,invt,o_r1,1'd0}; end // 1/Lambda1 



into R1 



multipiie 



stJ2:begin nxst8 = stJ3; fnk- {L$1,mult,o_r7,rd0}; end // S1/Lambdal 
stJ3:begin nxst8 = st_14; fnk = {4'M ,sqre,o_M,1'dO}; end // 1 /(Lambda 1 ★★2) 



into R1 



into R7 



5 



multipiie 



stJ4:begin nxst8«st_15; fnk-{4'h4.multo.r1. VdO); end // Lambda2 /( Lambda 1 ++2) 



into Rl 



multipiie 



/+ now we have equation of the form y**2 + y + c = =Oto solve */ 
/* where c = Lambda2/|Lambda1 **) ^ 
/* if one solution exists, we call it w 

stJ5:begin nxst8=stJ6; fnk = {4'h1,db1s,o_r1,1'd0}; end // w into R1 multiplier 
/* no such solution causes the two error solution to fail */ 

stJ6:begin nxstS = <{7bz}>&st_66) [«7{nz>}&stJ 7); fnk»{4'h1,load,o_r1,VdO); end // noop branch 
stJ7:begin nxst8=stJ8; fnk-{4'h3,mult.o.r4 f VdO); end // Lambda 1 w = XO into R4 
st.18:begin nxst8=st.19; fnk-{4'h4 f load.o.rard0}; end // XO into RO accumulator 
stJ9:beginnxst8 = stJa;fnk«{4'h3,accm,o.r5,VdO};end//Lambda1 + X.O = X1 into R5 

/* now we have the error locaters done, find error values */ 
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st.larbegin nxst8 = stjb; fnk = {LsO,mult.o_rO,1'dO}; end // SO w lnt0 R0 

st.1b:be 0 m nxst8-st_1c; fnk-{4'h7.accm, 0 _rO.VdO}; end // SO w + S1/Lambda1 = Y1 into RO 

/* a zero error value causes the two error solution to fail */ 

st.lcrbegin nxstS - <{7<bz}>&st_ 66|({7{nz}}&stjd); fnk-4mioad.o_rO,VdO>; end // noop branch 
5 st_1d:beoin nxst8 = stje; fnk = {MsO,accm,o_r6,1'dO}; end // SO + Y1 = YO int0 46 

/★ zero error value causes the two error solution to fail ♦/ 

st_1e:begin nxst8 = ({7{b 2 }}&st,66)|({7{nz}}&st.1f); fnk- { 4'h'6Joad,o.r6.rdO}; end //noop branch 
stjfibegin nxst8 - st.20; fnk^4'h0,load,o_r7.1'd0}; end // Y1 moved to standard place, R7 

/* double error solution is now complete {R4, R5, R6, R7} - {XO, XI, YO, Yl) */ 
10 /* check the solution against all syndromes: 



into R1 



st_20:begin nxst8 *=st_21; fnk = {4'h4,sqre,o_rl ,1'dO); end // X0**2 
multiplie 

st.21:begin nxst8«st_22; fnk = {4'h6,mult f o - r2,VdO}; end // X0**2 YO into R2 

st_22:begin nxst8 = st_23; fnk^'hS.sqre.o^rlJ'dO}; end // X1++2 mt0 R1 

15 multiplie 

st_23:begin nxst8 = st_24; fnk»{4'h7,mijlt,o_r3,VdO}; end // X1*+2 Yl 

st_24:begin nxst8 = st,25; fnk«{4'h3,load,o_r0,1'd0}; end // X1**2 Y1 
accumulat 



into R3 
into RO 
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st_25:begin nxst8 = st_26; fnk = {4'h2,accm.o.rO.VdO}; end // XU*2 Y1 + X0+*2 YO into RO 
accumulat 

st.26:begin nxst8=st.27; fnk = {i_s2,accm,o_r0,rd0); end // S2 + X1*+2 Y1 + X0**2 YO into RO 
/* a nonzero value causes the two error solution to fail */ 

5 st,27:begin nxst8 = ({7{nz})&st_66)|({7{bz}}&st^8); fnk = 4'hO,load,o_rO,1'dO); end // noop branch 
st_28:beain nxst8 = st_29; fnk«{4 f h4Joad ( o.r1J'dO}; end // XO into R1 

multtpiie 

st_29:begin nxst8 = st_2a; fnk = (4*h2.mult,o_r2, 1 'dO}; end // X0+*3 YO into R2 

st_2a:begin nxst8 = st_2b; fnk = {4'h5,load,o_r1 f 1'd0}; end // X1 into R1 

10 multiplie 

st_2b:begin nxst8«st.2c; fnk«{4'h3,mu1t,o_r3,1'd0}; end // XU*3 Y1 into R3 

st_2c:begin nxst8 = st_2d: fnk~{4'h3,load,o_r0,rd0); end // X1**3 Y1 into RO 

accumulat 

st_2d:begin nxst8 = st_2e; fnk = {4'h2,addm,o_r0,1'd0}; end // XU*3 Yl + X0*+3 YO into RO 
15 accumulat 

st,2e:begin nxst8 = st_2f; fnk = {i^s3,accm,o_r0,1'd0); end// S3 + X1+*3 Yl + X0+*3 YO into RO 



/* a nonzero value causes the two error solution to fail */ 



st_2f:begin nxst8 = {{7nz}}&st_66|({7{bz})&st_30); fnk=4'hO,load,o.rO,1'dO}; end // noop branch 
st.30:begin nxst8 = st.31; fnk = {4'h3,mult,o.r3,1'd0}; end // X1**4 Y1 i nto R3 

st_3l:begin nxst8 = st_32; fnk = {4'h4,load,o_rl,VdO}; end // XO into R1 
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multiplie 

sU2:begin nxst8 = st_33; fnk = {4'h2,mult,o„r2.rd0}; end // X0*+4 Y0 into R2 

st_33:begin nxst8~st34; fnk={4'h3,load,ojr0,1'd0}; end // X1*+4 Y1 into R0 

accumulat 

5 stJ34:begin nxst8=st_35; fnk = {4'h2,accrn,o_r0,1'd0}; end // X1+*4 Y1 + X0++4 YO into R0 
accumulat 

st_35:begin nxst8 = st_36; mk = {i_s4,accm,o_r0,rd0}; end // S4 + Xl**4 Yl + XO**4 YO into R0 
/* a nonzero value causes the two error solution to fail */ 

stJ36:begin nxst8 = ({7{n2}}&st.66|{{7{bz}}&st.37); fnk = {4'h0,load,o KU'dO}; end // noop branch 
10 stJ37:begin nxst8 -st_38; fnk = {4'h2,mult,o_r2,1'd0}; end // X0**5 YO into R2 

st_38:begin nxst8 = st_39; fnk = {4'h5,load.o_M,1'd0}; end // XI into R1 

multiple 

st_39:begin nxst8 = st_3a; fnk = {4'h3,mult,o_r3,1'd0}; end // X1**5 Y1 into R3 

st.3a:begin nxst8 = st.3b; fnk = {4'h3,load.o_r0,1'd0}; end // XU*5 Yl into R0 

15 accumulat 

st_3b:begin nxst8 = st.3c; fnk«{4'h2,accm,o.r0,Vd0}; end // X1+*5 Yl + X0**5 YO into R0 
accumulat 

st.3c:begin nxst8 = stjd; fnk-{i_s5,accm,ojU1'dO}; end // S5 + XU*5Y1 + X0**5 YO into R0 
/* a nonzero value causes th two error solution to fail */ 
20 st.3d:begin nxst8 = |{7nz}}&st.66|({7{bz}}&st.3e); fnk = {4'h0,load f o„r0, VdO}; end // noop branch 
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st_3e:begin nxstS - stJJf ; fnk = {4'h3,rnult.o_r3,1'd0}; end // XU*6 Yl into R3 

st_3f:begin nxst8 = st_40; fnk = {4'h4,load,o_r1,rdO}; end // XO jnt0 R1 

multiplie 

st_40:begin nxst8«st_41; fnk = {4'h2,mult,o_r2,rd0); end // X0**6 YO into R2 

st_41:begin nxst8 = st_42; fnk = {4'h3.load,o_r0, 1'dO}; end // X1+*6 Y1 j nto R0 

accumulat 

st.42:begin nxst8 = st.43; fnk-{4'h2,accm,o.r0, VdO); end // XU*6 Yl + XO**6 YO into RO 
accumulat 

st43:begin nxst8 = st_44; fnk = {i_s6,accrn,o_rO,VdO}; end // S6 + XU*6 Yl + XO*+6 YO into Ro 
/+ a nonzero value causes the tow error solution to fail */ 

st 44:begin nxst8 = <{7{nz}}&st_66)|({7{bz}}&st 45); fnk = {4'hOJoad,o_rO,VdO}; end //noop branch 
st45:begin nxst8 = st_46; fnk- {4'h2,mult,o_r2,1 'dO}; end // X0**7 YO into R2 

st_46:begin nxst8=st,47; fnk = {4'h5,load,o_rl ,1'dO}; end // XI into Rl 

multiplie 

st_47:begin nxst8«st48; fnk = {4'h3,mult,o_r3,rd0}; end // XU*7 Y1 i nto R3 

st.48:begin nxst8 = st_49; fnk«{4'h3,load,o_rO,VdO); end // XU*7 Yl into RO 

accumulat 

st.49:begin nxst8 = st_4a; fnk = {4'h2,accm,o_r0,rd0}; end //X1-*7 Y1 + XO**7 YO into RO 
accumulat 

st_4a:begin nxst8«st_4b; fnkHLs7,accrn,o_rO,1'dO}; end // S7 + XU*7 Yl + X0++7 YO into RO 
/★ a nonzero value causes the two error solution to fail +/ 
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st_4b:begin nxstS = ({7{n2}}&st.66({7{bz}}&st.4c); fnk«{4'h0,load,o,r0,1'd0}; end // noop branch 
st_4c:begin nxst8 = st_4d; fnk = {4'h2,stat,o_r2,1'd0}; end // set status register to 2 errors 
st_4d:begin nxstS ~ st_69; f nk = {4'h2,load,o_r2, 1 'dOJ; end // noop unconditional branch 

/* start single error decoding */ 

5 st_4e:begin nxst8 = st_4f; fnk = {i_s0,invt,oj1 , 1 'dO); end // 1 /SO int0 R1 

muttiplie 

/* a zero value causes the single error solution to fail */ 

st_4f :begin nxstS « <{7{bz}}&st_66l | ({7{nz}}&st J>0); fnk = <4'h1 Joad.o.M ' 1 'dO); end // noop branch 
st_50:begin nxst8«st_51; fnk = {i_s1,mult,o_r1,rd0); end //S1/S0 - XO into R1 

10 multiplie 

/* a zero value causes the single error solution to fail */ 

st.51 :begin nxstS = ({7{bz})&st 66) | ({7<nz})&st_52); fnk= {4'h1Joad,o.r1 , 1 'dO}; end // noop branch 
st.52:begin nxst8 = st_53; fnk-{Lsl,mult.o.rO,rdO}; end // S1 XO into RO 

accumulat 

15 st_53:begin nxst8-st_54; fnk«{U»2,accm,ojO,rdO}; end // S1 XO +S2 into RO 

accumulat 

/* a nonzero value causes the single error solution to fail */ 
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st_54:begin nxst8 - ({7{nz}}&st_66) | ({7{nz))&st_55); fnk « {4'h1 ,load,o_r1 , 1 'dO}; end // noop branch 
st_55:begin nxst8=st_56; fnk = {Ls2,rnult,o_rO,VdO}; end // S2 XO into RO 

accurnulat 

st_56:begin nxst8«st_57; fnk = {i.s3,accm,o_rO,VdO}; end // S2 XO + S3 into RO 

5 accurnulat 

/+ a nonzero value causes the single error solution to fail +/ 

st.57:begin nxst8 = ({7{nz}}&st.66) | ({7{bz}}&st.58); fnk » {4'hl ,load,o_r1 , V dO}; end // noop branch 
st.58:begin nxst8 = st_59; fnk = {i_s3, mult, o.rOJ'dO); end // S3 XO into RO 

accurnulat 

10 st_59:begin nxst8«st.5a; fnk = {i_s4,accm,ojO,1'dO}; end // S3 XO + S4 into RO 

accurnulat 

N a nonzero value causes the single error solution to fail */ 

st_5a:begin nxst8« ({7{nz})&st.66| ({7bz}}&st_5b); fnk = {4'h1 f load,o.r1 i rd0); end // noop branch 
st_5b:begin nxst8«st.5c; fnk«{Ls4,fnult,o.rO,rdO}; end // S4 XO into RO 

15 accurnulat 

st.5c:begin nxst8 = st_5d; fnk = {i_s5,accm,o_r0,1'd0}; end // S4 XO +S5 into RO 

accurnulat 

/+ a nonzero value causes the single error solution to fail */ 
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st_5d:begin nxst8«<{7{nz}}&st_66|({7{bz}}&st_5e); fnk»{4'h1,load,o_n,1'd0}; end // noop branch 
st_5e:begin nxst8 = st_5f; fnk = {i„s5,multo.rO,VdO}; end // S5 XO i nt0 RO 

accuniulat 

st_5f:begin nxst8 = st.60; fnk-{i.s6,accm,o.rard0); end // S5 XO + S6 into RO 

5 accumulat 



/* a nonzero value causes the single error solution to fail */ 



st_60;begin nxst8 = ({7{nz}}&st.66|({7bz}}&st.61); fnk = 4'M.load,o_rl.1'dO}; end // noop branch 
st.61:begin nxst8 = st_62; fnk = {ijs6,mult,o_r0,1 'dO}; end // S6 XO into R( 

accumulat 

st_62:begin nxst8=st_63; fnk = s7,accm,o_r0,1'd0}; end // S6 XO + S7 into R( 

accumulat 

/+ a nonzero value causes the single error solution to fail */ 



st.63:begin nxst8 = {{7{nz}}&st.66|({7{bz}}&st.64; fnk«{4'h1,!oad,o_M,1'dO); end // noop branch 
st_64:begin nxst8 = st_65; fnk«{4'M,stat,o_r1,VdO}; end // set status register to 1 error 
st.65:begin nxst8 = st_67; fnk = {4'h1,load,oj1,1'd0}; end // noop unconditional branch 



st.66:begin rixst8«st_69; fnk« {4'h0.stat,o_rO,1*dO}; end // decoder failure: set status to 0 

st.67:begin nxst8«st.68; fnk = {Lr1,load.o_r4,1'dO}; end // load rl value into r4 locater output 

st,68:begin nxst8 = st_69; fnk = {i_sO,load,o_r6,1'dO}; end // load SO value into r4 locater output 

st.69:begin nxst8 = st_69; fnk = {4'h0,load,o_r0,1'd1}; end // decoder complete 
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endcase 

always @ Iposedge constclk or negedge resetn) 
begin 

if (Iresetn) 

st8 < = st.69; 
else if (finished && decstart) 
st8 < « 7'dO; 

else 

st8 < = nxst8; 

end 

gf.alu mylstalu( 

.resetn (resetn), 

.syrsOOOCNsyrsOOOO), 

.syrsOOOKsyrsOOOl), 

.syrs0002<syrs0002), 

.syrs0003|syrs0003), 

.syrs0004(syrs0004) , 

.syrs0005 (syrs0005), 

.syrs0006(syrs0006), 

.syrs0007(syrs0007), 

.locaterO(locaterO), 

.locaterl (locaterl). 
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.errvaluO(errvaluO), 

.errvalul(errvalul), 

.decodclk(constclk), 

.instruct(instruct), 

.sourcadd(sourcadd), 

.destnadd(destnadd), 

.decrstat(decrstat), 

.instatus(instatus); 

endmodule 



module gf.alu (syrsOOOO, 


/+ syndrome 0 




*/ 


syrsOOOl, 


/* syndrome 1 




*/ 


syrs0002, 


/* syndrome 2 




*/ 


syrs0003, 


/+ syndrome 3 




*/ 


syrs0004, 


/* syndrome 4 




*/ 


syrsOOOS, 


/+ syndrome 5 




*/ 


syrs0006. 


/* syndrome 6 




w 


syrs0007, 


/* syndrome 7 




*/ 


locaterO, 


/* register 4 of ALU 


*/ 




locaterl , 


/+ register 5 of ALU 


*/ 




errvaluO, 


/* register 6 of ALU 


*/ 




errvalul, 


/* register 7 of ALU 


*/ 




decodclk, 


/★ alu positive edge clock 20 Mhz 


*/ 




instruct, 


/+ instruction to be executed 


W 
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input 
input 
input 
input 
input 
input 
input 
input 
input 
input 
input 
input 
input 



[9:0] 
[9:0] 
[9:0] 
19:0] 
[9:0] 
[9:0] 
[9:0] 
[9:0] 

[2:0] 
(3:0) 
[2:0] 



sourcadd, 
destnadd, 

instatus 

); 

resetn; 

syrsOOOO; 

syrsOOOl; 

syrs0O02; 

syrs0003; 

syrs0004; 

syrs0005; 

syrs0006; 

syrs0007; 

decodclk; 

instruct; 

sourcadd; 

destnadd; 



/★ address of source operand 
/+ address of destination operand 
decrstat, /+ status of decoder 

/* status of completed instruction 



*/ 
*/ 



output [9:0] locaterO; 

output [9:0] locaterl; 

output [9:0] errvaiuO; 

output [9:0] errvalul; 

output [1:0] decrstat; 
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output instaxus; 



//reg 




instatus; 






reg 


19:0] 


workregO; 






reg 


[1:0] 


decrstat; 






reg 


19:0] 


workregl; 






reg 


[9:0} 


workreg2; 






reg 


[9:0] 


workreg3; 






reg 


[9:0] 


workreg4; 






reg 


19:0] 


workregS; 






reg 


[9:0] 


workreg6; 






reg 


19:0] 


workreg7; 






reg 




instatus; 






reg 


19:0] 


sourcer; 






reg 


[9:0] 


aluout; 






reg 




status; 












// instruction 0 destination < « 


source 


wire 


[9:0] 


accumul; 


// instruction 1 destination < » 


sourcer accumulated w output 0 


wire 


[9:0] 


product; 


// instruction 2 destination < » 


sourcer multiplied by output 1 


wire 


[9:0] 


inverse; 


// instruction 3 destination < m 


inverse of sourcer 


wire 


[9:0] 


dblsoln; 


II instruction 4 destination < - 


soln of {y**2 + y + sourcer = = 


wire 


19:0] 


squared; 


II instruction 5 destination < = 


sourcer*+2 



0) 



// instruction 6 no operation 
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wire iszero; 

wire rioroot; 

wire statenab; 

wire [7:01 regenab; 

assign locaterO = workreg4; 
assign iocaxerl = workregS; 
assign errvaluO = workreg6; 
assign errvalul « workreg7; 

/* get source operand */ 



// instruction 7 no operation 



always @lsourcadd or 

workregO or workregl or workreg2 or workreg3 or 
workreg4 or workregB or workreg6 or workreg7 or 
syrsOOOO or syrsOOO! or syrs0002 or syrs0O03 or 
syrs0004 or syrsOOOS or syrsO0O6 or syrs0007) 

begin 

case (sourcadd) 
4'dO : sourcer = workregO; 
4'd1 : sourcer » workregl; 
4'd2 : sourcer = workreg2; 
4'd3 : sourcer ■ workreg3; 
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4'd4 : sourcer m workreg4 
4'd5 : sourcer = workreg5; 
4'd6 : sourcer = workreg6; 
4'd7 : sourcer » workreg7; 
4'd8 : sourcer = syrsOOOO; 
4'd9 : sourcer = syrsOOOl; 
4'd10 : sourcer = syrs0002; 
4'd1 1 : sourcer « syrs0003; 
4*d1 2 : sourcer » syrs0004; 
4'd13 : sourcer = syrs0005; 
4'd14 : sourcer = syrs0006; 
4'd15 : sourcer = syrs0007; 
endcase 

end 



/+ computation units */ 



assign 

multiply 

inverse 

dbltrbl 

squarer 



accumul = workregO A sourcer; 
decmultr(.x(workreg1| i .y(sourcer), .z(product)); 

decinvrt(.y{sourcer). ,z(inverseM; 
dectable(.c(sourcer), .y(dblsoln), .noroot)); 

decsquar(.x(sourcer), .z(squared)); 



/* assign alu output and status +/ 
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always ©(instruct or 
sourcer or 
accumul or 
product or 
inverse or 
dbisoln or 
squared or 
noroot or 
iszero) 

begin 

case (instruct) 
3'dO : begin aluout = sourcer; status - iszero; end 
3'd1 : begin aluout = accumul; status « iszero; end 
3'd2 : begin aluout = product; status - iszero; end 
3'd3 : begin aluout « inverse; status » iszero; end 
3'd4 : begin aluout = dbisoln; status = noroot; end 
3'd5 : begin aluout = squared; status « iszero; end 
3'd6 : begin aluout « sourcer; status - 1'dO; end 
3'd7 : begin aluout - 9'dO; status - VdO; end 
endcase 

end 

/* assign iszero as zero output of alu */ 



- 55 - 



WO 97/00559 



PCT/US96/10074 



assign iszero = ~(| aluout); 



/* assign output register enables */ 



assign statenab = (instruct = = 3'd6); 



5 



10 



assign regenabtOl 




(destnadd 


= s 


3'dO); 


assign regenablU 




(destnadd 


t= = 


3'<J1); 


assign regenab[2J 




(destnadd 




3'd2): 


assign regenabI3] 




(destnadd 




3'd3); 


assign regenab[4] 


= 


(destnadd 


ss = 


3'd4); 


assign regenab(51 




(destnadd 


SB = 


3'd5); 


assign regenab[6] 




(destnadd 


B = 


3'd6); 


assign regenabl7] 




(destnadd 


B S= 


3'd7); 



/* update outputisters */ 

always @ (posedge decodclk) 
begin 

//Note: add reset for workregO only, since a known value is 
//needed here during initialization, 
if (resetn * « 0) 

workregO < = 10'hOOO; 

else 
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workregO < = ({10{regenab[0J}} & aluout | |{10{~regenab[0]}} & workregO); 
workregl <= ({10{regenab[1]}} & aluout | in0{-regenab(1)}} & workregl); 
workreg2 < = {{10{regenab[2}}} & aluout | {[10{~regenabI2]}) & workreg2>; 
workreg3 <- <{10{regenab[3])} & aluout | (M0{~regenab[3]}} & workreg3); 
workreg4 < = {{10{regenabf4]}} & aluout | ([10{~regenab[4]}} & workreg4); 
workreg5 < = ({10{regenab(5]}> & aluout | (I10{~regenab[5)}} & workregS); 
workreg6 < = <{10{regenab[6]}} & aluout | |[10{-regenab[6]}} & workreg6); 
workreg7 <= ({10{regenab[7]}} & aluout | |[10{-regenab[7]}} & workreg7); 
instatus < = status; 

if (statenab) 

decrstat <= sourcadd[1:OJ; 

end 
endmodule 

module multiply U, y, 2); // galois field multiplier of x and y to get z 

input {9:01 x; 
input [9:0) y; 
input [9:0J z; 

wire (4:0] IsbO; 
wire [4:01 msbO; 
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wire (4:0) 1sb1; 
wire [4:0J msbl; 
wire [4:0] msbOmsbl; 
wire (4:0) UbOlsbl; 
wire f4;0] msbOlsbl; 
wire {4:01 msbllsbO; 
wire (4:0J pOmOml; 
wire (4:01 pi mOml; 

assign IsbO - x(4:0); 
assign 1sb1 « y[4:0); 
assign msbO » x[9:5J; 
assign msbl = y[9:5J; 

gatemu!5 subm0{ .xdsbO), .yMsbl), ,z(1sb01sb1 )); 
gatemu!5 submK .x(msbO), .y(msb1>, .z(msbOmsbl)); 
gatemu!5 subm2( .xdsbO), ,y(msb1), .zdsbl 1sbO)); 
gatemu!5 subm4( .x(msbO), .ydsbl), .z(msbOUbl)); 
assign pOmOml = msbOmsbl; 
assign pi mOml * msbOmsbl; 

assign z[4:0] = 1sbOlsb1 A p0mOm1; 

assign z{9:5] - msbOlsbl * msbllsbO A plmOml; 
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endmodule 

module gatemu!5(x, y, z); // gafois field multiplier of x and y to get z 

input 14:0) x; 
input (4:0) y; 
input (4:0) zx; 

wire (4:0] alphaOx; 
wire 14:0] alpha Ix; 
wire [4:0] alpha2x; 
wire [4:0] alpha3x; 
wire [4:0] alpha4x; 

assign alphaOx « x; 

assign alphalx = {alpha0x[3:0],1'b0} * <{5{alpha0xf4]}} & 5'd5}; 
assign alpha2x = {alpha1x[3:0],1'b0} * <{5{alpha1x(4]}} & 5'd5); 
assign alpha3x = {alpha2x[3:OJ,1'bO} A ({5{alpha2x[4]}} & 5'd5); 
assign alpha4x = {alpha3x(3:0l,1'b0} A ({5{alpha3x[4]}} & 5'd5); 

assign z[0] *» 

(alphaOx(O) & y(0]) A 
<alpha1x[0] & yflj) * 
(alpha2x(0] & yl2]) * 
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<alpha3x[0] & y[3]| * 
(alpha4x[01 & y[4J); 

assign z[1] = 

<alphaOxf1]&ylO]) A 
(alphalxH) & y(U) * 
<alpha2x[1) & y[2]) A 
(alpha3xt1l & y[3)) A 
(alph34x[1] &y(4)); 

assign 2(21 = 

Ialpha0x[2] & y(01) A 
(alpha1x(2l & y[1J) * 
(alpha2xf2J & y[2]> A 
(alpha3x[2J & y[3]) * 
(alpha4x[2J & y[4]); 

assign z[3J « 

<alphaOx[3) & y(0]) A 
(alpha! xf3] & y[lj) A 
(alpha2x(31 & y[2J) A 
(alpha3xf3J & y[3]> A 
(alpha4x[3) & y[4)); 
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assign z[4] ■= 

Ialpha0x[4] & ylO)) A 
(alpha1x[4) & vtD) * 
Ia!pha2x[4] & y[2J) * 
(alpha3x(4J & y[31) A 
(alpha4x(4) & y(4]); 



endmodule 



module squarer(x ( z); 
input (9:0] x; 
output {9:01 z; 

assign z « ({10{x(01}} & 10'dl) 
({10{x(1]}} & 10'd4) 
({10{x[2]}} & 10'd16) 
({10{x[3]}} & 10'd10> 
({10{xl4l}} & 10'd13) 
{{10{x(5]}} & 10'd33) 
U10{x[6}}} & 10'dl32> 
{{10{xm}} & 10'd528) 
({10{x(8]}} & 10'd330) 
!{10{x(9]}) & 10'd429) 

endmodule 
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module inverse (y, 2); // galois field inverter of y to get 2 

input ' [9:0) y; 
input 19:01 2; 

wire 14:0] 1sb; 
5 wire 14:0] msb; 

wire (4:01 lsblsb; 
wire (4:0J msbmsb; 
wire (4:0] Isbmsb; 
wire (4:0] ptmsb; 
10 wire [4:0] denom; 
wire [4:0) numer; 
wire [4:0] pOmsbmsb; 
wire [4:0] pllsbmsb; 
wire (4:0) Htlinvt; - 

15 assign 1sb « y[4:0); 

assign msb = y[9:5]; 

gatemulS subm0( .x(1sb), .ydsb), .2<1sb01sb»; 
gatemu!5 subrnK .x(msb), .y(msb), ,2{msbmsb]); 
assign plmsb = msb; 
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gatemul5 subm4{ .x(p1msb), .y(1sb), .z(p1 Isbmsb)); 
assign pOmsbmsb = msbmsb; 

assign number = 1sb~p1msb; 

assign denom = 1sb1sb A pOmsbmsb *p1 Isbmsb; 

5 gf32inv invO(.xldenom), .zllitlinvt)); 

gatemu!5 subm5( .xllitlinvt), .y(numer), .z|[4:0])); 
gatemulS subm6( .xditlinvt), .y(msb), .z|z[9:5])); 

endmodule 

module gf32inv(x, z); 
10 input 14:0] x; 

output [4:0] z; 

reg 14:0) z; 
always @(x) 



begin 



15 



case \x) 



5,b00000 : z 



5'bOOOOO; 



5,b00001 : z 



5'b00001 



5,b00010 : z 



5'b10010; 



5,b0001 1 : z 



5'b1l100; 
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5,b00100 : 2 = 5'b01001; 

5,b00101 : z = 5'b101 11; 

5,b00110 : 2 = 5'b01110; 

5,b00111 : 2 *= 5'b01100; 

5,b01000 ; 2 m 5'b10110; 

5,b01001 : 2 » 5'b00100; 

5,b01010 : 2 » 5'b11001; 

5,b01011 : 2 = 5'b10000; 

5,b01100 : 2 « 5'b00111; 

5,b01101 : 2 « 5'b01111; 

5,b01110 : 2 « 5'b00110; 

5,b01111 : 2 = 5'b01101; 

5,b10000 : 2 = 5*b01011; 

5,b10001 : 2 « 5'bllOOO; 

5,b10010 : 2 = 5'bOOOIO; 
5,b10011 : 2 = 5'b11101; 
5,b10100 : 2 = 5'b11110; 
5,b10101 : 2 = 5'b11010; 
5,b10110 : 2 = 5'b01000; 
5,b10111 : 2 = 5'b00101; 
5,b11000 : 2 = 5'b10001; 
5,b11001 : 2 = 5'b01010; 
5,b11010 ; 2 = 5'b10101; 
5,b11011 : 2 « 5*b1 1111; 
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5,b11100 : 2 = 5'b00011 
5,b11101 : z m 5'b1001 1; 
5,b11110 : 2 » 5'b10100; 
5,b1 1111 : 2 = 5'bl 101 1 ; 
endcase 
end 

endmodule 

module dbltrbl (c, y, noroot); 

input 19:0] c; 

output (9:0 y; 

output noroot; 

assign noroot = c(81 A c(5]; 



assign y(90] 




VdO 








assign y[1] 


— 


c[7J * 


c(4] * 


cl2); 




assign y[2] 




c[7] * 


c(61 * 


c|4] * 


c[3J; 


assign y(3] 




c[9] * 


cf6) * 


cl4] * 


c(2) 


assign y[7J 




cf9I A 


c[5); 






assign y(9] 




c[5I; 








assign y[6] 




c[9] A 


c[7]; 






assign y[8] 




V(6J * 


c[6J; 






assign y(4] 




y(6] * 


cI6] * 


cl$) A 


cf3J; 


assign y(5I 




y[81 A 


c[3J A 


cK>]; 




endmodule 
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Thus, an error correction methodology and system architecture 
for implementing a 10-bit Reed-Solomon code has been disclosed. 
While the foregoing detailed description was directed to a 
semiconductor based data storage and retrieval system, it will be 
apparent to those skilled in the art that the described error 
correction methodology and architecture can be effectively 
practiced with any digital data storage system, e.g., such as 
magnetic or optical based memory systems, as well as with any 
system involving the handling, transfer, and/or storage of blocks 
of digital data between elements thereof, including digital 
communications systems, where the correction of data being 
transmitted or stored is required. While the present invention is 
particularly well suited for systems and architectures encountering 
non-bursty, substantially random data errors, it may be equally 
employed in those systems and architectures encountering bursty 
error patterns, as well. 

Thus, it would be apparent to those skilled in the art that 
many more modifications are possible without departing from the 
inventive concepts herein. The invention, therefore is not to be 
restricted except in the spirit of the appended claims. 
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What is claimed is: 

1. A data storage and retrieval system for receiving, 
storing and retrieving, respectively, digital data paketized in 
blocks of k m-bit data symbols, comprising: 

means for reformatting the blocks of m-bit data symbols into 
blocks of 10-bit data symbols; 

an encoder having means for generating r 10-bit error check 
symbols from each reformatted block of 10-bit data symbols; 

means for reformatting said r 10-bit error check symbols 
generated by said encoder into (10/m)*r m-bit error check symbols; 

means for appending said (10/m)*r m-bit error check symbols to 
the respective m-bit data block from which they were generated; and 
data storage means for receiving and storing, respectively, 
said m-bit data blocks, including the respective appended (10/m)*r 
m-bit error check symbols. 

2. The data storage and retrieval system of claim 1, further 
comprising means for retrieving said stored m-bit data blocks from 
said data storage means and reformatting them into respective 10- 
bit code words, each retrieved 10-bit code word comprising k 10-bit 
data symbols and r 10-bit error check symbols. 
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3. The data storage and retrieval system of claim 2, further 
comprising 

a syndrome generator for generating r 10-bit error syndromes 
from each retrieved 10-bit code word; 

means for detecting the existence of at least one non-zero 
error syndrome in a given set of generated error syndromes; and 

error correction circuitry comprising: 

means for receiving said r 10-bit error syndromes from said 
syndrome generator when at least one non-zero syndrome is detected, 

a dedicated Galois field arithmetic logic unit 
having means for executing arithmetic operations on 10- 
bit symbols by performing extension field operations over 
a Galois field of 2 5 , 

a programmed controller for providing state 
instructions to said arithmetic logic unit which carry 
out a programmed error correction routine for determining 
the location of, and correction value for, respectively, 
up to 2 erroneous 10-bit data symbols in a given 
retrieved 10-bit code word. 



4. The data storage and retrieval system of claim 3, said 
dedicated Galois field arithmetic logic unit utilizing a Galois 
field GF(2 5 ) generated from the irreducible polynomial g(x) - x s + 
x 2 + l, over GF(2) . 
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5. The data storage and retrieval system of claim 4, said 
dedicated Galois field arithmetic logic unit further generating the 
Galois field GF(2 10 ) from the irreducible polynomial P l02 4 (y) = y 2 + 
y + 1, over GF(2 5 ) . 

6. The data storage and retrieval system of claim 5, said 
Galois field GF(2 10 ) having as its primitive element (101)h. 

7. The data storage and retrieval system of claim 3, said 
data storage means characterized in that data errors occurring in 
a block of data stored therein occur in substantially random bit 
locations. 

8. The data storage and retrieval system of claim 3, said 
data storage means comprising a semiconductor based memory. 

9. The data storage and retrieval system of claim 8, said 
semiconductor based memory comprising a FLASH random access memory. 

10. The data storage and retrieval system of claim 3, said 
encoder combined with said syndrome generator in a single circuit. 
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4 
5 
6 



9 



1 11. A data storage and retrieval system for receiving, 

2 storing and retrieving, respectively, data packetized in blocks of 

3 m-bit data symbols, comprising: 
an interface circuit for receiving the blocks of ro-bit data 

symbols from an external data processing system; 

a first buffer for temporarily storing the blocks of m-bit 

7 data symbols from said interface circuit; 

8 means for reformatting said blocks of m-bit data symbols 
received from said interface circuit into blocks of k 10-bit data 

10 symbols; 

11 an encoder circuit having means for generating r 10-bit error 
check symbols from each block of k 10-bit data symbols; 

means for reformatting said r 10-bit error check symbols 
generated by said encoder into (10/m)*r m-bit error check symbols, 
wherein said (lO/m)*r m-bit error check symbols are appended in 
said first buffer to the respective m-bit data block from which 

17 they were generated; 

18 data storage means for receiving and storing, respectively, 

19 said m-bit data blocks, including the respective appended m-bit 

20 error check symbols from said first buffer; 
means for retrieving said stored m-bit data blocks from said 

data storage means and reformatting them into respective 10-bit 
code words, each retrieved 10-bit code word comprising k 10-bit 
24 data symbols and r 10-bit error check symbols; 



12 
13 
14 
15 
16 



21 
22 
23 
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25 a syndrome generator for generating r 10-bit error syndromes 

26 from each retrieved 10-bit code word; 

27 ' means for detecting the existence of at least one non-zero 

28 error syndrome; and 

29 error correction circuitry comprising: 

30 means for receiving said r 10-bit error syndromes 

31 from said syndrome generator when at least one non-zero 

32 syndrome is detected, 

33 a dedicated Galois field arithmetic logic unit 

34 having means for executing arithmetic operations on 10- 

35 bit symbols by performing extension field operations over 

36 a Galois field of 2 5 , 

37 a programmed controller for providing state 

38 instructions to said arithmetic logic unit which carry 

39 out a programmed error correction routine for determining 

40 the location of, and correction value for, respectively, 

41 up to 2 erroneous 10-bit data symbols in a given 

4 2 retrieved 10-bit code word. 

1 12. The data storage and retrieval system of claim 11, said 

2 dedicated Galois field arithmetic logic unit utilizing a Galois 

3 field GF(2 5 ) generated from the irreducible polynomial g(x) = x 5 + 

4 x 2 + 1, over GF(2) . 
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13. The data storage and retrieval system of claim 12, said 
dedicated Galois field arithmetic logic unit further generating the 
Galois field GF(2 10 ) from the irreducible polynomial P I024 (y) = y 2 + 
y + 1, over GF(2 5 ), and having as its primitive element (101)h . 

14. The data storage and retrieval system of claim 11, 
further comprising a second buffer for receiving and temporarily 
storing, respectively, said retrieved m-bit data blocks from said 
data storage means. 

15. The data storage and retrieval system of claim 11, 
wherein r = 8. 

16. The data storage and retrieval system of claim 11, 
wherein (io/m)*k = 256, m - 16, and r = 8, respectively. 

17. The data storage and retrieval system of claim 11, 
wherein (10/m)*k = 512, m « 8, and r - 8, respectively. 
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18. In a data processing system, error correction apparatus 
for detecting and correcting data errors, comprising: 

means for receiving r 10-bit error syndromes generated from a 
10-bit Reed Solomon code word; 

a dedicated Galois field arithmetic logic unit having means 
for executing an error correction algorithm by performing 
arithmetic operations on 10-bit symbols; 

a programmed controller for providing state instructions to 
said arithmetic logic unit which carry out said error correction 
algorithm. 

19. The error correction apparatus set forth in claim 18, 
said dedicated Galois field arithmetic logic unit generating a 
Galois field GF{2 10 ) using the irreducible polynomial P l024 (y) = y 2 + 
y + 1, over a Galois field 2 5 , said Galois field 2 5 being generated 
using the irreducible polynomial g(x) = x 5 + x 2 + 1, over GF(2) . 

20. The data processing system in claim 18, further 
comprising a semiconductor based digital data storage and retrieval 
sub-system. 
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21. In a data processing system, error correction apparatus 
for detecting and correcting data errors, comprising: 

an arithmetic logic unit having means for executing an error 
correction algorithm on 10-bit error syndromes, said arithmetic 
logic unit utilizing a Galois field GF(2 5 ) generated from the 
irreducible polynomial g(x) « x 5 + x 1 + l, over GF(2) ; and 

a programmed controller for providing state instructions to 
said arithmetic logic unit which carry out said error correction 
algorithm. 

22. The data storage and retrieval system of claim 21, said 
arithmetic logic unit further utilizing a a Galois field GF(2 10 ) 
generated from the irreducible polynomial P ia24 (y) = y 2 + y + 1, over 
GF(2 5 ) . 
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